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1 Introduction 

The problem of an unknown common change in means of the panels is studied 
here, where the panel data consist of N panels and each panel contains T 
observations over time. Various values of the change are possible for each 
panel at some unknown common time r = 1,..., iV. The panels are considered 
to be independent, but this restriction can be weakened. In spite of that, 
observations within the panel are usually not independent. It is supposed that 
a common unknown dependence structure is present over the panels. 

Tests for change point detection in the panel data have been proposed 
only in case when the panel size T is sufficiently large, i.e., T increases over all 
limits from an asymptotic point of view, cf. [1] or [5]. However, the change point 
estimation has already been studied for finite T not depending on the number 
of panels V, see [5]. The remaining task is to develop testing procedures to 
decide whether a common change point is present or not in the panels, while 
taking into account that the length T of each observation regime is fixed and 
can be relatively small. 


1.1 Motivation 

Structural changes in panel data—especially common breaks in means —are 
wide spread phenomena. Our primary motivation comes from non-life insur¬ 
ance business, where associations in many countries uniting several insurance 
companies collect claim amounts paid by every insurance company each year. 
Such a database of cumulative claim payments can be viewed as panel data, 
where insurance company i = 1,... ,iV provides the total claim amount 
paid in year t = 1,... ,T into the common database. The members of the 
association can consequently profit from the joint database. 

For the whole association it is important to know, whether a possible 
change in the claim amounts occurred during the observed time horizon. Usu¬ 
ally, the time period is relatively short, e.g., 10-15 years. To be more specific, 
a widely used and very standard actuarial method for predicting future claim 
amounts—called chain ladder—assumes a kind of stability of the historical 
claim amounts. The formal necessary and sufficient condition is derived in m- 
This paper shows a way how to test for a possible historical instability. 


2 Panel change point model 

Let us consider the panel change point model 

= Mi + Sil{t > t } -t 1 ^ ^ iV, 1 ^ ^ T; (1) 

where cr > 0 is an unknown variance-scaling parameter and T is fixed, not 
depending on N. The possible common change point time is denoted by t e 
{1,...,T}. A situation where t = T corresponds to no change in means of 
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the panels. The means Hi are panel-individual. The amount of the break in 
mean, which can also differ for every panel, is denoted by 6i. Furthermore, it is 
assumed that the sequences of panel disturbances are independent and 

within each panel the errors form a weakly stationary sequence with a common 
correlation structure. This can be formalized in the following assumption. 

Assumption A1 The vectors ..., existing on a probability space 

(17, F, P) are iid for t = 1,... ,N with Esi^t = 0 and Vare^^t = 1, having the 
autocorrelation function 

Pt Corr {si^s 5 ) Cov VsgII,...,^ t}, 

which is independent of the lag s, the cumulative autocorrelation function 

t 

r{t) = Var ^ = X! 

S = 1 |s|<t 

and the shifted cumulative correlation function 

/ t V \ t V 

R(t^V^ Cov I 'y I y I ^i,u I y I y | Pu—si t V 

\s = l u=t-\-l / s = l u=t-\-l 

for alH = 1,... ,N and t,v = 1,...,T. 

The sequence {ei^t}’t=i can be viewed as a part of a weakly stationary pro¬ 
cess. Note that the dependent errors within each panel do not necessarily need 
to be linear processes. For example, GARCH processes as error sequences are 
allowed as well. The assumption of independent panels can indeed be relaxed, 
but it would make the setup much more complex. Consequently, probabilistic 
tools for dependent data need to be used (e.g., suitable versions of the central 
limit theorem). Nevertheless, assuming, that the claim amounts for different 
insurance companies are independent, is reasonable. Moreover, the assump¬ 
tion of a common homoscedastic variance parameter cr can be generalized by 
introducing weights Wi^t, which are supposed to be known. Being particular 
in actuarial practice, it would mean to normalize the total claim amount by 
the premium received, since bigger insurance companies are expected to have 
higher variability in total claim amounts paid. 

It is required to test the null hypothesis of no change in the means 

Ho: t = T 

against the alternative that at least one panel has a change in mean 


Hi : T < T and e {1,..., A} : ^^7^0. 
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3 Test statistic and asymptotic results 


We propose a ratio type statistic to test Hq against Hi, because this type of 
statistic does not require estimation of the nuisance parameter for the variance. 
Generally, this is due to the fact that the variance parameter simply cancels 
out from the nominator and denominator of the statistic. In spite of that, 
the common variance could be estimated from all the panels, of which we 
possess a sufficient number. Nevertheless, we aim to construct a valid and 
completely data driven testing procedure without interfering estimation and 
plug-in estimates instead of nuisance parameters. A bootstrap add-on is going 
to serve this purpose as it is seen later on. 

For surveys on ratio type test statistics, we refer to [5], [ 6 ], [7], [13], and [l4] . 
Our particular panel change point test statistic is 


TlNiT) 


maxs=i_...^t 

lji=l 



t=2,...,T-2 

maxs=t_..._7’_i 

sf.i 


Yi,r 

]| 


where is the average of the first t observations in panel i and is the 
average of the last T --1 observations in panel i, i.e.. 


t T 

= 7 2 and Yi^t = 77 ^ 2 


S=1 




An alternative way for testing the change in panel means could be a usage 
of CUSUM type statistics. For example, a maximum or minimum of a sum 
(not a ratio) of properly standardized or modified sums from our test statistic 
TZn{T). The theory, which follows, can be appropriately rewritten for such 
cases. 

Firstly, we derive the behavior of the test statistics under the null hypoth¬ 
esis. 


Theorem 1 (Under null) Under hypothesis Hq and Assumvtion lA 1\ 
TlNiT) 


—^ max 

Af^oo t=2,...,T-2 


maxs=y...,t 

Xs-IX,\ 

maxs=t_...^T-i 

7 T—s 7 

T—t 


where Zt := Xt — Xt and [^Xi,... ,Xt\^ is a multivariate normal random 
vector with zero mean and covariance matrix A = {Xt^v}t v=i such that 

^t,t = r{t) and Xt^v = 'f’{t) + R(t,v), t <v. 


The limiting distribution does not depend on the variance nuisance pa¬ 
rameter (T, but it depends on the unknown correlation structure of the panel 
change point model, which has to be estimated for testing purposes. The way 
of its estimation is shown in Subsection 01 Furthermore, Theorem [1] is just 
a theoretical mid-step for the bootstrap test, where the correlation structure 
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need not to be known. That is why the presence of unknown quantities in the 
asymptotic distribution is not troublesome. 

Note that in case of independent observations within the panel, the cor¬ 
relation structure and, hence, the covariance matrix A is simplified such that 
r(t) = t and R{t,v) = 0. 

Next, we show how the test statistic behaves under the alternative. 


Assumption A2 lim^v- 


1 

Vn 


2 ji=l 


= 00 . 


Theorem 2 (Under alternative) Ifr ^ T—3, then under Assumvtions \Al\ 
IA£I and alternative Hi 


TZj\f(T) -> CO. 

N^od 


( 2 ) 


Assumption IA2I is satisfied, for instance, if 0 < <5 ^ Vj (a common lower 
change point threshold) and S'/N co, N ^ co. Another suitable example 
of 6iS for the condition in Assumption can be 0 < for 

some K > 0 and 77 > 0. Or = Ci°‘~^'/N may be used as well, where 
a > 0 and C > 0. The assumption r ^ T — 3 means that there are at least 
three observations in the panel after the change point. It is also possible to 
redefine the test statistic by interchanging the nominator and the denominator 
of TZn{T). Afterwards, Theorem[2]for the modified test statistic would require 
three observations before the change point, i.e., r > 3. 

Theorem [ 2 ] says that in presence of a structural change in the panel means, 
the test statistic explodes above all bounds. Hence, the procedure is consistent 
and the asymptotic distribution from Theorem [T] can be used to construct the 
test. 


4 Change point estimation 

Despite the fact that the aim of the paper is to establish testing procedures 
for detection of a panel mean change, it is necessary to construct a consistent 
estimate for a possible change point. There are two reasons for that: Firstly, 
the estimation of the covariance matrix A from Theorem 1 requires panels as 
vectors with elements having common mean (i.e., without a jump). Secondly, 
the bootstrap procedure, introduced later on, requires centered residuals to be 
resampled. 

A consistent estimate of the change point in the panel data is proposed 
in [5], but under circumstances that the change occurred for sure. In our 
situation, we do not know whether a change occurs or not. Therefore, we 
modify the estimate proposed by [2] in the following way. If the panel means 
change somewhere inside {2,... ,T — 1}, let the estimate consistently select 
this change. If there is no change in panel means, the estimate points out the 
very last time point T with probability going to one. In other words, the value 
of the change point estimate can be T meaning no change. This is in contrast 
with [2], where T is not reachable. 
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Let US define the estimate of r as 


N t 


^ IV t, ^ 

tn ■■= arg min — ^ V V (Vi s - 

t=2....,T w(t) ^ 

where {w{t)}f ^2 is a sequence of weights specified later on. 

, T 


Assumption Cl The sequence 


t 

w{t) 


1 _ ’■(*) 
^ *2 


^ I is decreasing. 


Assumption C2 There exist constants L > 0 and Nq e N such that 


(3) 


L < 


t 

w{t) 




Tjt-T) 1 y 
tw{t) * 


for each t = t + 1,..., T and N > Nq . 

Assumption C3 limAr^oo = 0. 

Assumption C4 Eef ^ < oo, t e {1,..., T}. 

Theorem 3 (Change point estimate consistency) Suppose that r ^ 1. 
Then under Assumptions Al, IC'll I6’M \ CA and \C4\ 


lim Plfiv = r] = 1. 

Af-.oo 


Assumption IC2I assures that the values of changes have to be large enough 
compared to the variability of the random noise in the panels and to the 
strength of dependencies within the panels as well. Assumption ICSI is needed to 
control the asymptotic boundedness of the variability of Z;s=i(^.s “ 

because a finite T cannot do that. 

Assumptions IC2I and IC3I are satisfied for 0 < (5 ^ <5^ < A, Mi (a common 
lower and upper bound for the change amount) and suitable a, r(t), and w(t). 
The monotonicity Assumption ICll in not very restrictive at all. For example 
in case of independent observations within the panel (i.e., r(t) = t) and weight 
function w(t) = q ^ 2, this assumption is automatically fulfilled, since 
sequence — t~'^}f ^2 is decreasing. This also gives us an idea how to 

choose weights w{t). 

If one is interested in sensitivity of the change point estimate (i.e., what 
is the size of the change that can be estimated), let us consider the following 
model scenario: T = 10, r = 5, u = 0.1, independent observations within the 
panel, and w{t) = t^. Then, Assumption IC2I is satisfied if > 0.029 

for all A > Aq. In case of a common value of 5 = <5^ for all i, we need 
S> VOD^^ 0.170. 

Assumption IC2I can be considered as too complicated. Therefore, one can 
replace it by the following simpler, but more restrictive assumption. 


Assumption C5 


N 


li™ — 

Af—>00 N 




= 00. 











Testing Structural Changes in Panel Data with Small Fixed Panel Size 


7 


On one hand, this assumption might be considered as too strong, because 
a common fixed (not depending on N) value of 5 = for all i does not 
fulfill Assumption IC5I On the other hand. Assumption IC5I is satisfied when 
Sj/N ^ 00 as A^ ^ 00 for some j e N and Si = 0 for all i j. This stands for 
a situation when all the panels do not change in mean except one panel having 
a sufficiently large change in mean with respect to the number of panels. 

Various competing consistent estimates of a possible change point can be 

suggested, e.g., the maximizer of ~ • To show the con¬ 

sistency, one needs to postulate different assumptions on the cumulative auto¬ 
correlation function and shifted cumulative correlation function compared to 
Theorem [3] and this may be rather complex. 


4.1 Estimation of the correlation structure 


Since the panels are considered to be independent and the number of panels 
may be sufficiently large, one can estimate the correlation structure of the 
errors [ei,i,... empirically. We base the errors’ estimates on residuals 


• — 


^ ^ '^Ni 


Then, the empirical version of the autocorrelation function is 

N T-t 


(4) 


pt ■■= 


^ 1\ J. — L 

]\fT 2 2 

i=l s=l 


PNT 


S + t • 


Consequently, the kernel estimation of the cumulative autocorrelation function 
and shifted cumulative correlation function is adopted in lines with [I] : 


rit) = ^ A 

|s|<t 

t V 






S=1u=t+l 


U — S 


Pu — S 


t < v: 


where h > 0 stands for the window size and k belongs to a class of kernels 
given by 

r+” 

1 k(-) : ]R ^ [—1,1] I k(0) = 1, k{x) = k{—x), Vx, K^(x)dx < oo, 

^ J—00 

k{-) is continues at 0 and at all but a finite number of other points!. 


Since the variance parameter a is not present in the limiting distribution 
of TheoremlU it neither has to be estimated nor known. Nevertheless, one can 

'^‘2 1 ^ T ^2 

use cr ■= ^i,s- 
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5 Bootstrap and hypothesis testing 

A wide range of literature has been published on bootstrapping in the change 
point problem, e.g., [5] or m- We build up the bootstrap test on the resam¬ 
pling with replacement of row vectors {[e^p,... ,ei^T]}i=i,...,N corresponding 
to the panels. This provides bootstrapped row vectors ..., 

Then, the bootstrapped residuals are centered by their conditional expec¬ 
tation A Xiili yielding 


T,- 


1 ^ 


i=l 


The bootstrap test statistic is just a modification of the original statistic 
7^vr(T), where the original observations Yi^t are replaced by their bootstrap 
counterparts Y*^: 


n%{T) 


maxs=i,...,t 


(y*^ - %)] 


t=2,...,T-2 

niaxs=i,...,T-i 

•^-\N 

( Y* — Y* 

Z.r=s + 1 

)] 


such that 


V* 


= -;^% and 


V* _ 


s=l 


T-t 






An algorithm for the bootstrap is illustratively shown in Procedure [T] and 
its validity will be proved in Theorem 01 


Procedure 1 Bootstrapping test statistic TZn{T). 

Input: Panel data consisting of N panels with length T, i.e., N row vectors 
of observations ..., Yi^t]- 

Output: Bootstrap distribution of TZn{T), i.e., the empirical dis¬ 
tribution where probability mass 1/B concentrates at each of 

(i)^w(^)i • ■ ■ ’ {b)T^*n{T)- 
1: estimate the change point by calculating t^v 
2 : compute residuals 

3: for & = 1 to B do // repeat in order to obtain the empirical distribution 
4: . ■., resampled with replacement from original rows 

5: calculate bootstrap panel data Y*^ 

6: compute bootstrap test statistics (;,)7?.^(T) 

7: end for 
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5.1 Validity of the resampling procedure 


The idea behind bootstrapping is to mimic the original distribution of the test 
statistic in some sense with the distribution of the bootstrap test statistic, 
conditionally on the original data denoted by Y = 

First of all, two simple and just technical assumptions are needed. 

Assumption B1 possesses the lagged cumulative correlation function 

( t V \ t V 

2 ^ £i,u ) = Xj X ^ 

s=l u=t-\-d / s=l u=t-\-d 

Assumption B2 lim 7 v-.oo = r] = 1. 

Assumption IBII is not really an assumption, actually it is only a notation. 
Notice that S{t,v,l) = R{t,v). Assumption IB2I is satisfied for our estimate 
proposed in m, if the assumptions of Theorem [5] hold. Assumption IB2I is 
postulated in a rather broader sense, because we want to allow any other 
consistent estimate of r to be used instead. 

Realize that it is not known, whether the common panel means’ change 
occurred or not. In other words, one does not know whether the data come 
from the null or the alternative hypothesis. Therefore, the following theorem 
holds under Hq as well as Hi. 


Theorem 4 (Bootstrap justification) Under Assumptions \A1\. \B1\. \BS\ 
and \CJi\ 


n%{T)\Y 


3 ) 

Af—>00 


Taasis=i,...,t 

T, - f A, 


t=2,...,T-2 

maxs=t_..._7’_i 

Z — 

T-t 

Zt 


in probability P, 


where Zt := Xt — Xt and [Xi,...,Xt]^ is a multivariate normal random 
vector with zero mean and covariance matrix F = such that 




r{t) + ^r{T)-^i[r{t) + R{t,T)l 
t < t; 

0, t = r; 

r{t -t)+ - r) - [r{t - t) + R{t - t,T - r)] , 

t > T- 


and 


'0, t = T or V = T, 

r{f) + R{t, v) + ^r{T) - ^[r{t) + R{t, r)] 

-^[r{v) + R{v,t)], t<v<T; 

S{t, v,T + l-t) + T) 

-|E75'(t,T,r + 1 - t) - t < t < v; 

r{t — t) + R{t — T,v — t) + - t) 

t) + R{t - t,T - t)] 

. - t) + R{v - t,T - t)], T<t<v. 
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The validity of the bootstrap test is assured by Theorem 01 Indeed, the 
conditional asymptotic distribution of the bootstrap test statistic is a func¬ 
tional of a multivariate normal distribution under the null as well as under the 
alternative. It does not converge to infinity (in probability) under the alter¬ 
native. That is why it can be used for correctly rejecting the null in favor of 
the alternative, having sufficiently large N. Moreover, the following theorem 
states that the conditional distribution of the bootstrap test statistic and the 
unconditional distribution of the original test statistic coincide. And that is 
the reason why the bootstrap test should approximately keep the same level 
as the original test based on the asymptotics from Theorem [I] 

Theorem 5 (Bootstrap test consistency) Under Assumptions \A1[ \B^ 
\C4\ and hypothesis Hq, the asymptotic distribution ofTZi\[(T) from Theorem[l\ 
and the asymptotic distribution o/72.^(T)|Y from Theorem^ coincide. 

Now, the simulated (empirical) distribution of the bootstrap test statistic 
can be used to calculate the bootstrap critical value, which will be compared 
to the value of the original test statistic in order to reject the null or not. 

Finally, note that one cannot think about any local alternative in this 
setup, because t has a discrete and finite support. 

6 Simulations 

A simulation experiment was performed to study the finite sample properties 
of the asymptotic and bootstrap test statistics for a common change in panel 
means. In particular, the interest lies in the empirical sizes of the proposed 
tests under the null hypothesis and in the empirical rejection rate (power) 
under the alternative. Random samples of panel data (5000 each time) are 
generated from the panel change point model (ED- The panel size is set to 
T = 10 and T = 25 in order to demonstrate the performance of the testing 
approaches in case of small and intermediate panel length. The number of 
panels considered is = 50 and N = 200. 

The correlation structure within each panel is modeled via random vec¬ 
tors generated from iid, AR(1), and GARCH(1,1) sequences. The considered 
AR(1) process has coefficient (j) = 0.3. In case of GARCH(1,1) process, we use 
coefficients ao = 1, ai = 0.1, and /3i = 0.2, which according to [121 Example 1] 
gives a strictly stationary process. In all three sequences, the innovations are 
obtained as iid random variables from a standard normal N(0,1) or Student 
^5 distribution. Simulation scenarios are produced as all possible combinations 
of the above mentioned settings. 

When using the asymptotic distribution from Theorem [1] the covariance 
matrix is estimated as proposed in Subsection 14.11 using the Parzen kernel 
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Several values of the smoothing window width h are tried from the interval 
[2, 5] and all of them work fine providing comparable results. To simulate the 
asymptotic distribution of the test statistics, 2000 multivariate random vectors 
are generated using the pre-estimated covariance matrix. 

The bootstrap approach does not need to estimate the covariance structure. 
The number of bootstrap replications used is 2000. To access the theoretical 
results under Hq numerically, Table [1] provides the empirical specificity (one 
minus size) of the tests for both the asymptotic and bootstrap version of the 
panel change point test, where the significance level is a = 5%. 


Table 1 Empirical specificity (1—size) of the test under Hq using the asymptotic and the 
bootstrap critical values, considering a significance level of 5%, weight function 'w{t) = 
and smoothing window width h = 2. 


T 

N 

innovations 

IID 


AR(1) 

GARCH(1,1) 

10 

50 

N(0,1) 

.942 

.959 

.932 

.962 

.952 

.968 



^5 

.950 

.967 

.933 

.962 

.947 

.966 


200 

N(0,1) 

.950 

.964 

.938 

.968 

.950 

.968 



tb 

.950 

.964 

.934 

.964 

.941 

.963 

25 

50 

N(0,1) 

.945 

.961 

.933 

.965 

.947 

.963 



tb 

.949 

.964 

.929 

.964 

.947 

.963 


200 

N(0,1) 

.951 

.962 

.928 

.964 

.953 

.968 



tb 

.954 

.965 

.931 

.966 

.953 

.967 


It may be seen that both approaches (using asymptotic and bootstrap 
distribution) are close to the theoretical value of specificity .95. As expected, 
the best results are achieved in case of independence within the panel, because 
there is no information overlap between two consecutive observations. The 
precision of not rejecting the null is increasing as the number of panels is 
getting higher and the panel is getting longer as well. 

The performance of both testing procedures under Hi in terms of the 
empirical rejection rates is shown in Table [21 where the change point is set to 
T = [r/2j and the change sizes Si are independently uniform on [1,3] in 33%, 
66% or in all panels. 

One can conclude that the power of both tests increases as the panel size 
and the number of panels increase, which is straightforward and expected. It 
should be noticed that numerical instability issues may appear for larger T, 
when generating from a T-variate normal distribution. Moreover, higher power 
is obtained when a larger portion of panels is subject to have a change in mean. 
The test power drops when switching from independent observations within 
the panel to dependent ones. Innovations with heavier tails (i.e., t^) yield 
smaller power than innovations with lighter tails. Generally, the bootstrap 
outperforms the classical asymptotics in all scenarios. 
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Table 2 Empirical sensitivity (power) of the test under H\ using the asymptotic and the 
bootstrap critical values, considering a significance level of 5%, weight function 
and smoothing window width h = 2. 


Hi 

T 

N 

innovations 

IID 

AR(1) 

GARCH(1,1) 

33% 

10 

50 

N(0,1) 

.23 

.06 

.26 

.07 

.19 

.05 





tb 

.18 

.05 

.20 

.06 

.20 

.05 




200 

N(0,1) 

.45 

.05 

.48 

.05 

.39 

.05 






.36 

.05 

.39 

.05 

.39 

.05 



25 

50 

N(0,1) 

.38 

.05 

.39 

.05 

.31 

.05 





tb 

.30 

.05 

.30 

.05 

.31 

.06 




200 

N(0,1) 

.68 

.05 

.70 

.05 

.58 

.05 





tb 

.56 

.05 

.57 

.05 

.59 

.05 


66% 

10 

50 

N(0,1) 

.45 

.39 

.49 

.46 

.38 

.10 





tb 

.36 

.10 

.37 

.14 

.39 

.15 




200 

N(0,1) 

.77 

.59 

.81 

.93 

.68 

.05 





tb 

.64 

.05 

.69 

.12 

.69 

.10 



25 

50 

N(0,1) 

.69 

.08 

.70 

.11 

.58 

.05 





tb 

.56 

.05 

.57 

.06 

.59 

.06 




200 

N(0,1) 

.95 

.06 

.96 

.05 

.91 

.05 





tb 

.87 

.05 

.89 

.05 

.91 

.05 


100% 

10 

50 

N(0,1) 

.64 

.92 

.67 

.84 

.56 

.58 





tb 

.52 

.37 

.55 

.45 

.55 

.55 




200 

N(0,1) 

.93 

1.00 

.95 

1.00 

.87 

.77 





tb 

.85 

.36 

.87 

.74 

.87 

.72 



25 

50 

N(0,1) 

.87 

.84 

.88 

.85 

.79 

.11 





tb 

.76 

.08 

.77 

.11 

.79 

.20 




200 

N(0,1) 

1.00 

.97 

1.00 

.97 

.99 

.05 





tb 

.98 

.05 

.98 

.05 

.99 

.07 



Let US mention that for finite sections of processes with a stronger depen¬ 
dence structure than taken into account in the simulation scenarios, Assump- 
tion IC2I does not have to be fulfilled. The dependency under the considered 
variability can be too strong compared to the change size. Then, it would be 
rather difficult to detect possible changes. 

Finally, an early change point is discussed very briefly. We stay with stan¬ 
dard normal innovations, iid observations within the panel, the size of changes 
Si being independently uniform on [1, 3] in all panels, and the change point is 
T = 3 in case of T = 10 and r = 5 for T = 25. The empirical sensitivities of 
both tests for small values of r are shown in Table |31 

When the change point is not in the middle of the panel, the power of 
the test generally falls down. The source of such decrease is that the left or 
right part of the panel possesses less observations with constant mean, which 
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Table 3 Empirical sensitivity of the test for small values of r under Hi using the asymptotic 
and the bootstrap critical values, considering a significance level of 5%, weight function 
w{t) = and smoothing window width h = 2. 


T 

N 

r 

Hi, : 

iid, N(0,1) 

10 

50 

3 

.56 

.08 


200 

3 

.87 

.05 

25 

50 

5 

.63 

.05 


200 

5 

.92 

.05 


leads to a decrease of precision in the correlation estimation in case of the 
asymptotic test and in the change point estimation in case of the bootstrap 
test. Nevertheless, the bootstrap test again outperforms the asymptotic version 
and, moreover, provides solid results even for early or late change points (the 
late change points are not numerically demonstrated here). 


7 Real data analysis 

As mentioned in the introduction, our primary motivation for testing the panel 
mean change comes from the insurance business. The data set is provided 
by the National Association of Insurance Commissioners (NAIC) database, 
see [15]. We concentrate on the ‘Commercial auto/truck liability/medical’ in¬ 
surance line of business. The data collect records from N = 157 insurance 
companies (one extreme insurance company was omitted from the analysis). 
Each insurance company provides T = 10 yearly total claim amounts starting 
from year 1988 up to year 1997. Figure [T] graphically shows series of claim 
amounts for 20 selected insurance companies (a plot with all 157 panels would 
be cluttered). 

The data are considered as panel data in the way that each insurance com¬ 
pany corresponds to one panel, which is formed by the company’s yearly total 
claim amounts. The length of the panel is quite short. This is very typical 
in insurance business, because considering longer panels may invoke incompa¬ 
rability between the early claim amounts and the late ones due to changing 
market or policies’ conditions over time. 

We want to test whether or not a change in the claim amounts occurred in 
a common year, assuming that the claim amounts are approximately constant 
in the years before and after the possible change for every insurance company. 
Our ratio type test statistic gives 77.157(10) = 39.9. The asymptotic critical 
value is 52.4 and the bootstrap critical value equals 203.1. These values mean 
that we do not reject the hypothesis of no change in panel means in both 
cases. The striking difference between the two critical values may come from 
the inefficient correlation structure estimation (since T = 10 is quite short) or 
from violation of the model assumptions. 
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Fig. 1 Development of yearly total claim amounts for 20 selected insurance companies 


That is why we also try to take the logarithms of claim amounts and 
to consider log amounts as the panel data observations. Nevertheless, we do 
not reject the hypothesis of no change in the panel means (i.e., means of log 
amounts) again. Additionally to that, one can consider normalizing the claim 
amounts by the premium received by company i in year t. That is thinking 
of panel data where is the mentioned premium. This may yield 

a stabilization of series’ variability, which corresponds to the assumption of 
a common variance. In spite of that, we again do not reject the null (neither by 
the asymptotic test, nor by the bootstrap one). For the sake of completeness, 
we may reveal that our estimate of the panel change point provides value 
Tjv = 10 meaning no change in panels. 


8 Conclusions 

In this paper, we consider the change point problem in panel data with fixed 
panel size. Occurrence of common breaks in panel means is tested. We in¬ 
troduce a ratio type test statistic and derive its asymptotic properties. Un¬ 
der the null hypothesis of no change, the test statistic weakly converges to 
a functional of the multivariate normal random vector with zero mean and co- 
variance structure depending on the intra-panel covariances. As shown in the 
paper, these covariances can be estimated and, consequently, used for testing 
whether a change in means occurred or not. This is indeed feasible, because 
the test statistic under the alternative converges to infinity in probability. 

The secondary aim of the paper lies in proposing a consistent change point 
estimate, which is straightforwardly used for bootstrapping the test statistic. 
We establish the asymptotic behavior of the bootstrap version of the test 
statistic, regardless of the fact whether the data come from the null or the 
alternative hypothesis. Moreover, the asymptotic distribution of the bootstrap 
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test statistic coincides with the original test statistic’s limiting distribution. 
This provides justification for the bootstrap method. One of the main goals 
is to obtain a completely data driven testing approach whether the means 
remain the same during the observation period or not. The ratio type test 
statistic allows us to omit a variance estimation and the bootstrap technique 
overcomes estimation of the correlation structure. Hence, neither nuisance nor 
smoothing parameters are present in the whole testing process, which makes it 
very simple for practical use. Furthermore, the whole stochastic theory behind 
requires relatively simple assumptions, which are not too restrictive. 

A simulation study illustrates that even for small panel size, both presented 
approaches—based on traditional asymptotics and on bootstrapping—work 
fine. One may judge that both methods keep the significance level under the 
null, while various simulation scenarios are considered. Besides that, the power 
of the test is slightly higher in case of the bootstrap. Finally, the proposed 
methods are applied to insurance data, for which the change point analysis in 
panel data provides an appealing approach. 


8.1 Discussion 

First of all, it has to be noted that the non-ratio CUSUM type test statistic can 
be used instead of the ratio type, but this requires to estimate the variance 
of the observations. The statements of theorems and proofs would become 
even less complicated. Omitting the usage of the bootstrap test statistic can 
especially be unreliable in short panels from a computational point of view. 
This is due to the fact that the bootstrap overcomes the issue of estimating 
the correlation structure. 

Furthermore, our setup can be modified by considering large panel size, i.e., 
T —> cx). Consequently, the whole theory leads to convergences to functionals 
of Gaussian processes with a covariance structure derived in a very similar 
fashion as for fixed T. However, our motivation is to develop techniques for 
fixed and relatively small panel size. 

Dependent panels may be taken into account and the presented work might 
be generalized for some kind of asymptotic independence over the panels or 
prescribed dependence among the panels. Nevertheless, our incentive is deter¬ 
mined by a problem from non-life insurance, where the association of insurance 
companies consists of a relatively high number of insurance companies. Thus, 
the portfolio of yearly claims is so diversified, that the panels corresponding to 
insurance companies’ yearly claims may be viewed as independent and neither 
natural ordering nor clustering has to be assumed. 


A Supporting Theorems 

Suppose that is a sequence of random variables/vectors existing on a probability 

space (f2, y, P). A bootstrap version of ^ = [^i,...,4n]^ is its (randomly) resampled se¬ 
quence with replacement—denoted by —with the same length, where 
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for each i G {1,... ,n} it holds that ^ ^j] ^ ^ ^ 1/^5 J ^ 1,... ,n. In 

the sequel, P| denotes the conditional probability given So, has a discrete uniform 
distribution on {^i ,...,^ 71 } for every i = 1,..., n. The conditional expectation and variance 
given ^ are denoted by Ep* and Varp:}:. 

If a statistic has an approximate normal distribution, one may be interested in the 
asymptotic comparison of the bootstrap distribution with the original one. A tool for as¬ 
sessing such an approximate closeness can be a bootstrap central limit theorem for triangular 
arrays. 


Theorem A.l (Bootstrap CLT for triangular arrays) Let {^n,fcn}S=i ^ triangular 
array of zero mean random variables on the same probability space such that the elements 
of the vector iid for every n G'N satisfying 

sup 1 < 00 (5) 

neN 


and /cri 00 as n —> 00 . Suppose that • 5 bootstrapped version of 

5 = Kn,l, ■ ■ • o-nd denote 


kji 

in '■= y 5n,ii in 

i—1 




Varp5„,i. 


If 


then 


liminf = <;^ > 0, 
n —»00 



ykn / \ ^ 


a/ kn p- 

Pl 

L V^n 

- p 

~Y^in ^ a; 

L vv J 


(6) 


Theorem A.2 (Bootstrap multivariate CLT for triangular arrays) Let 

be a triangular array of zero mean q-dimensional random vectors on the same probability 
space such that the elements of the vector sequence {^n,!, • • • , in,kn } every n G N 

satisfying 

sup Ep| 5 ^^il^ < 00, je { l ,..., q }, ( 7 ) 

neN 

where ^ri,l = ^ n G N and kn —*■ co as n oo. Assume that = 

Kn 1 ’ * • • ’ fc bootstrapped version of S = [^n,i ,..., ■ Denote 


kji kn 

fc-l y it-.= k-^'LC,v r„ :=Varp«„,i. 

i=l i=\ 


If 


then 


liminfTn = T > 0 , 
n—*oo 


( 8 ) 


P| [^fki^n^''^ {it - in) =£ x] - P [-Jkirn^'^in =£ x] 0, Vx 


S R'J. 


B Proofs 

Proof (of Theorem fTI) Let us define 

. N t 

UN{f) ■■= -J= X 
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Using the multivariate Lindeberg-Levy CLT for a sequence of T-dimensional iid random 
vectors £i,s, ■ ■■, SjLi £i,s]^}ieN, we have under Hq 






since Var Indeed, the t-th diagonal element of the covari- 

ance matrix A is 

t 

Var y] = r{t) 

S = 1 

and the upper off-diagonal element on position (f, v) is 


/ t u \ t / ^ ^ 

Cov I Si,s, £i,„ J = Var £i^s + Cov I si^s, ‘"it 

\s = l u=l / s=l \s = l u = t-\-\ 

— r{t) 4- R{t, v), t < V. 

Moreover, let us define the reverse analogue to UN{t), i.e., 


-.NT 

yivW:=^=y; x; {Yi,s-i-ii) = UN{T)-UNit). 

s=t+i 


Hence, 


l^f^r 11 

Un{s) - = -J= X X - 7 I! 

V [r=l L ^ v = l i j 


and, consequently, 


N f T V T n ^ 

yivW-^IVW = ^X S iy^,r - l^i) - X 

T-t o-VAf[ y-K^+1 JJ 


, N T 


Using the Cramer-Wold device, we end up with 


max 

t=2,...,T-2 . 


maxs=l,...,t \UNis) - ffl'jv(f) 
a-y^s=t,...,T-l |yiv('S) — 


maxs=i,.,.,t \Xs — jXt\ 


N^co t=2,...,T-2 max,^t„.„T--i {Xt - Xs) - ^{Xt - Xt)\ 
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Proof (of Theorem Wi) Let t = r + 1. Then, under alternative Hi 

N 


1 


N T 


L 

i=l 


Xi - Yi,-r + l) 




1 Af r / 1^ + 1 1 \ 

gg//^^ + 

N 

X-5i 


^ Vn g ,4 +1) 


= Op(l) + 


cr(T + 1)^/7^ 




•00, TV ^ 00, 


where Si^r+l ^ 7 2 ^ii ^i,v- 

Since there is no change after r + l and r ^ T — 3, then by Theorem^ we have 


tVTV s = t + 1,...,T-1 


X X Kr-Yi 


i=l r^s+l 


^i,T +1 




N—*(X) s=t + 1,...,T— 1 


7 

^T+1 

I — T 


Proof (of Theorem f^) Let us define 


^ Y^iYi.s -Yi,tf 

Ht) +1 

and, consequently, Sj^{t) := -^YiiLi Then, 




+y SXi(^hs i =£ r, 

+ SXT + l('^^i.s “ 


1 spt 
t 

have for 2 ^ t ^ r 


where Si^t = 7 2 s=i^i,s' the definition of the cumulative autocorrelation function, we 


^2 t 


t2 _1 




Ht) +1 


2 ^ 1 / N 

1-7 Zj E£i,s£i,r + ^r(£) 


t 


w(t) 


In the other case when t > r, one can calculate 






a-^t 

w{t) 


1 - ■ 


r(£) \ r(£ — r) 


£2 


£if;(£) 


6l 


Realize that S'^^(£) — E5^^(£) are independent with zero mean for fixed £ and i ^ 1,..., TV. 
Due to Assumption IC4l for 2 ^ £ ^ r it holds 


1 ^ 

VarSiv(t) = ^ X 


+1 w’^(i) 


Var 


X i^i,s - £i,t) 


- 
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where Ci(t, cr) > 0 is some constant not depending on A^. If t > r, then 
N 


Var 5iv(t) = ^ ^ 1 . Var 

.■^1 w ^{ t ) 


^ _ j. 

XI “ 2—j—a5i X iei,s - £i.t) 

S=1 S=1 

+ (V) 

+ 2-a5i X i^i.s - £i,t) + [-) &i 


1 1 

— C2{t,T,a) + -J^C3(t,T,(T) X + j^Ci{t,T,a) X 


where Cj(£, r, cr) >0 does not depend on N for j = 2, 3, 4. 

The Chebyshev inequality provides S'iv(t) — ES'Af(t) ^ O-p ^y^VarS'iv(t)) as TV —> oo. 
According to Assumption IC3l and the Cauchy-Schwarz inequality, we have 


1 iv iv 

4 y ^ - y (52 ^ 0, TV 

y2 Z_J * ^ /V\ /V ^ ^ ’ 


iv\ iv y 


Since the index set {2,..., T} is finite and r is finite as well, then 

1 1 ^ 1 
Var Sjv(i) ^ X 


X-Si 




where iVj(cr) > 0 are constants not depending on TV for j = 1,2, 3,4. Thus, we also have 
uniform stochastic boundedness, i.e., 

max |Sjv(0 - E5jv(0l = C’t:’ I ^ I , N ->■ <X). 

2s:ts:T' \VN J 

Adding and subtracting, one has 

Siv(t) - 5jv(r) = 5jv(t) - ESjv(t) - [Sjv('r) - ESjv(r)] 

+ E5jv(0 - E5jv(r) 

> -2 max \SN{r) - ESjv(r-)| + E5jv(t) - E5jv('r) 

2^r<T 

^ —2 max — E5Af(^*)| 

2<r<T 


+ cr 


* b _ ^ (i _ !Ti1 




u;(r) 


The above inequality holds for each t e {2,... ,T} and, particularly, it holds for rjv- Note 
that Tjv = argmint Sjv(f)' Hence, Sjv(?iv) — Spf{T) ^ 0. Therefore, 

2 W max |Sjv(f) — E5jv(»’)| 

2^r^T 


> VnI 


tn 

t«(TJv) 


_ Kyi) _ T K _ KK1 


+ I{fN > t} 


Tjfff - t) 1 
rjvm(Tjv) -N 


X^? ■ 


(9) 
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If tn > T almost surely for infinitely many TV, then the left hand side of 10 is O'p(l) as 
TV ^ 00 , but the right hand side is unbounded because of Assumption If <'t almost 
surely for infinitely many TV, then due to the monotonicity Assumption ICl I 


<-2 max iS'iv(^) — 

iV—>00 2<r^T 


^ (7 


Tn 


1 - 


rim) 


f2 


w{t) 


1 - 


r{T) 


> 0 , 


which is a contradicting conclusion. Hence, P[fiv = r] ^ 1 as TV ^ oo. 

Proof (of Theorem\^ Let us define Ss^i ^i,s, cr~^ Ss-i ^i,s 

N t T N 




and 




. N t . N t / N 


1 S=1 
N t 






Realize that depends on r^v and, hence, it depends on TV. Thus, Since 

Assumption lC4l holds. then according to the bootstrap multivariate CLT for triangular arrays 
(Theorem IA.2II of T-dimensional vectors ^N,i = [ei,i(A^)» • • • ?with kjsf = TV, we 
have 


P U*{T)]^ ==: x|y] - P , Un{T)]^ =£ x] 

—VxgR^, 
N—*co 


where Fn ^ Var [e*,!,..., 

Now, it is sufficient to realize that [^iv(l-)? • • • ? has an approximate multivariate 

normal distribution with zero mean and covariance matrix F = limjv^oo Using the law 
of total variance, 

Varci^t - E[Var{ei,t|Tiv}] + Var [Eje^^tlriv}] 

T T 

^ XI ^ 7r]Var[ei,t|fjv ^ ^r] + ^ P[fjv ^ 7r]{E[ei,t|fiv ^ tt]}^ 

7r = l 7r=l 


T 

Yj PpJV = 7r]E[ei,t|Tjv = tt] 

7r= 1 


Since limjv-.oc] P[?JV = t] = 1 and E[ei^t|rjv = r] = 0, then 

lim Var?i i= lim Var pi t|fjv = r], 
N—*co N—*oo 


Similarly with the covariance, i.e., after applying the law of total covariance, we have 


lim Cov pi t,?i,„) = lim Cov (5i t, ei,« |tjv = r) . 
iV—>00 N—*00 


Note that 




iei,t\rN ^ r) 


t ^ T', 

~ t > T] 
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where 


1 * 


and £i^t = 


T-t 


X 


S = t+1 


Taking into account the definitions of r(t), R(t, v), and S{t, v, d) together with some simple 
algebra, we obtain that Var = r] = 7 £,t(T) and Cov = t) = 'yt,v{T) for 

t < V, where the elements 'yt,t{'T) and 7 t,D(r) are as in the statement of Theorem [4| 

Then the sum in the nominator of 7^^(T) can be alternatively rewritten as 


7^/N 


XX 


y* _ V* 
^ i.r ^ i.t 


T^/~N 


X 


X 




- f y y.* 

i La 



Concerning the denominator of 7^^ (T), one needs to perform a similar calculation as in 
the proof of Theorem ^ with 1^(0? i-®-? to define V}v(t) and V^{t) analogously to C/iv(t) and 
U^{t) as Viv(0 is to C/iv(0- Applying the Cramer-Wold theorem completes the proof. □ 

Proof (of Theorem Recall the notation from the proof of Theorem Under Ho. IB2I 
and ED it holds 

lim P[fiv ^ T] ^ 1. 
iv^oo 

Then in view of @1, 

lim P rUAr(s)-UAr(t) ^ UAr(s)-Uiv(t)l ^1, 


Proof (of Theorem lA. 1\) The Lyapunov condition [Sj p. 371] for a triangular array of random 
variables k.n}^=i is satisfied due to ((SJ and ((fijl, i.e., for u = 2: 


■y/hn'i'i 


X E|5n,i|^+“ ^ ^0, n ^ CO. 

. <;n teN 


i=\ 


Therefor, the CLT for holds and 


sup 

xelR 



\/ kji — 

r 1 f *7 

p 

V 



• 0 . 


Now, to prove the theorem, it suffices to show the following three statements: 


(i) sup^gji P| 


y/kn 


p*^n,l 


-pj}: C 


^ X 


-z 


00 VaF 


exp 1 — y f dt 


( 11 ) 

(ill) E [P]-a.s. 


0 ; 


Proving (in) is trivial, because Epjg* = Ep|fn,l = [P]-a.s. 

Let us calculate the conditional variance of the bootstrapped variable ,: Var p* , = 

Ep*?7i - , [P]-a.s. The weak law of large 

numbers together with © provides 


^71 fT' ^ 
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and 


0 S il,i - pn ^ - Ep«n,l = Var 

i=l \ i=i J « 

The last result of the WLLN is true, because 0 implies 

kn'^ S Varp^^ i ^-2 ^ fc-lsupEpgf i > 0. 


Thus is proved. 

The Berry-Esseen-Katz theorem (see |llp with g{x) = |xp, e > 0 for the bootstrapped 
sequence of iid (with respect to P*) random variables iljii results in 


sup 

a:€lR 


P* 




Var 


p* 


71,1 






dt 


^ Ck. 


— e/2 r 


^n,l 


-p* ^71,: 


Var 


p* ^71,; 


2 + e 


[P]-a.s., 


( 10 ) 


for all n G N where C > 0 is an absolute constant. 

The Jensen inequality and Minkowski inequality provide an upper bound for the nomi¬ 
nator from the right-hand side of (llOlt : 




^n,i k^ ^ 


j = l 


2 + e 


1/(2+6) ’I 2+e 

<{ ( S 

1 = 1 

2 + e 


.i=l 

kn 


=s2i+'fc-i y; i«n,i|2+' + 2 
1=1 


[P]-a.s. 


The right-hand side of the previously derived upper bound is uniformly bounded in proba¬ 
bility P, because of Markov’s inequality and 0. Indeed, for fixed 77 > 0 




=£ y Ep|€n,i| 


2 + e 


; 77 ^ supEp| 5 i,i|^+' < CO, 

leN 


Vn e IN 


and 


P 




ifc-iEp ye„,i 


- supEp|^,,i| < 00 , 

leN 


VnG N. 


Since Ep;}: |^* ^ — Ep;)j^* is bounded in probability P uniformly over n and the denom¬ 

inator of the right-hand side of unii is uniformly bounded away from zero due to ©, then 
the left-hand side of (llOH converges in probability P to zero as n tends to infinity. So, (i) is 
proved as well. □ 


Proof (of Theorem \A.2\) According to the Cramer-Wold theorem, it is sufficient to ensure 
that all assumptions of one-dimensional bootstrap CLT ro] for triangular arrays are valid 
for any linear combination of the elements of the random vector $n,i, n G N. 
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For arbitrary fixed t G IR^ using the Jensen inequality, we get 


sup Ep|t^^^,i| 
neN 


‘ sup ^ 
neN 




l?n ll'‘ s'* ^ max sup 
’ J = 1 . 9 neN 


Hence, assumption 0 implies assumption for the random variables {t^^n fcn}’^eN• 

Similarly, assumption implies assumption for such an arbitrary linear combina¬ 
tion, i.e., positive definiteness of the matrix F yields 

lim inf Var ^ lim inf t^ (Var p^^ i) t ^ Aim inf Fn] t ^ t^Pt > 0. 

n—*-00 ’ n—*00 ' \ n—►00 / 

□ 
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